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ON THE ROTATION DISTANCE BETWEEN BINARY TREES 



PATRICK DEHORNOY 



Abstract. We develop combinatorial methods for computing the rotation 
distance between binary trees, i.e., equivalently, the flip distance between tri- 
angulations of a polygon. As an application, we prove that, for each n, there 
exist size n trees at distance 2n — 0(^/n). 



If T, T" are finite binary rooted trees, one says that T' is obtained from T by 
one rotation if T" coincides with T except in the neighbourhood of some inner node 
where the branching patterns respectively are 



Under the standard correspondence between trees and bracketed expressions, a ro- 
tation corresponds to moving a pair of brackets using the associativity law. If two 
trees T, T" have the same size (number of inner nodes) , one can always transform T 
to T" using finitely many rotations. The rotation distance dist(T, T") is the min- 
imal number of rotations needed in the transformation, and d(n) will denote the 
maximum of dist(T, T") for T, T" of size n. Then d(n) is the diameter of the nth 
associahedron, the graph K n whose vertices are size n trees and where T and T" 
are adjacent if and only if dist(T, T") is one. 

There exists a one-to-one correspondence between size n trees and triangulations 
of an (n + 2)-gon. Under this correspondence, a rotation in a tree translates into 
a flip of the associated triangulation, i.e., the operation of exchanging diagonals in 
the quadrilateral formed by two adjacent triangles. So d{n) is also the maximal flip 
distance between two triangulations of an (n + 2)-gon. 

In [17j . using a simple counting argument, D. Slcator, R. Tarjan, and W. Thurston 
prove the inequality d(n) ^ 2n — 6 for n ^ 11 and, using an argument of hyperbolic 
geometry, they prove d(n) ^ In — 6 for n N , where N is some ineffective (large) 
integer. A brute force argument gives d(n) — In — 6 for 11 ^ n ^ 19. It is natural 
to conjecture d{n) = 2n — 6 for n ^ 11, and to predict the existence of a combinato- 
rial proof. After [17] , various related questions have been addressed [T3l [TBI E EH , 
or [2] for a general survey, but it seems that no real progress has been made on the 
above conjecture. 

The aim of this paper is to develop combinatorial methods for addressing the 
problem and, more specifically, for proving lower bounds on the rotation distance 
between two trees. At the moment, we have no complete determination of the value 
of d(n), but we establish a lower bound in 2n — 0(^fn) that is valid for each n. 

Theorem. For n = 2m 2 , we have d(n) 2n — 2\/2n + 1. 




and 
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We shall develop two approaches, which correspond to two different ways of 
specifying a rotation in a tree. The first method takes the position of the subtree 
that is rotated into account. This viewpoint naturally leads to introducing a partial 
action of Thompson's group F on trees and to expressing the rotation distance 
between two trees T, T 1 as the length of the element of F that maps T to T' with 
respect to a certain family of generators. This approach is very natural and it easily 
leads to a lower bound in |n + O(l) for d(n). However, due to the lack of control 
on the geometry of the group F, it seems difficult to obtain higher lower bounds in 
this way. 

The second approach takes names, rather than positions, into account: names 
are given to the leaves of the trees, and one specifies a rotation using the names 
of certain leaves that characterize the considered rotation. This approach leads to 
partitioning the associahedron K n into regions separated by sort of discriminant 
curves. Then, one proves that two trees T, T 1 are at distance at least I by showing 
that any path from T to T" through K n necessarily intersects at least i pairwise 
distinct discriminant curves. Progressively refining the approach finally leads to 
the lower bound 2n — 0(^/n). No obstruction a priori forbids to continue up to 
In — 6 but a few more technical ingredients will probably be needed. 

The paper is organized as follows. After setting the framework in Section [TJ we 
develop the approach based on positions in Section [2 and use it to deduce lower 
bounds for d(n) that he inn + 0(1), and then in |n + 0(l). Section [3] presents the 
approach based on names, introducing the so-called covering relation, a convenient 
way of describing the shape of a tree in terms of the names attributed to its leaves. 
This leads to a new proof for a lower bound in |n+0(l). In Section|H we introduce 
collapsing, an operation that consists in erasing some leaves in a tree, and use it to 
improve the previous bound to |n + 0(1). Finally, in Section \5[ applying the same 
method in a more tricky way, we establish the In — 0{yfn) lower bound. 

We use N for the set of all nonnegative integers. 



1.1. Trees. All trees we consider are finite, binary, rooted, and ordered (for each 
inner node: the associated left and right subtrees are identified). We denote by •, 
the tree consisting of a single vertex. If Tq,T\ are trees, Tq a Ti is the tree whose 
left subtree is To and right subtree is T±. The size \T\ of a tree T is the number of 
symbols A in the (unique) expression of T in terms of • and A . Thus • A ((» A ») A ») is 
a typical tree, usually displayed as 



Its decomposition comprises three carets, so its size is 3. 

Certain special trees will play a significant role, namely those such that, for each 
inner node, only one of the associated subtrees may have a positive size. Such a 
tree is completely determined by a sequence of O's and l's, called its spine. 

Definition 1.1. (See Figure [T]) For a a finite sequence of O's and l's, the thin tree 
with spine a, denoted (a), is recursively defined by the rules 



(1.2) (0) = •, (0) = (1) = . A ., and (Qa) = (a) A ., (la) = . A (a> for a £ 0. 



1. Trees, rotations, and triangulations 



(1.1) 
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For instance, the tree of is thin, with spines 100 and 101. Defining the 

spine so that it is not unique may appear surprising, but, in this way, (a) has size n 
for a of length n, which will make statements simpler. 




FIGURE 1. Thin trees, with their spines in bold: from left to right, 
(100101100), which is also (100101101), the right comb of size 5, the 
left comb of size 4, the right zigzag of size 9, the left zigzag of size 8. 

Some particular families of thin trees will often appear in the sequel, namely 

- right combs (111...) and their counterparts left combs (000...), 

- right zigzags (101010...) and left zigzags (010101...) — see Figure [TJ 

1.2. Rotations. For each vertex v (inner node or leaf) in a tree T, there exists a 
unique subtree of T with root in v — see (|2.1|) below for a more formal definition. 
This subtree will be called the v-subtree of T. For instance, if v is the root of T, 
then the ^-subtree of T is T itself. Then we can define the rotations mentioned in 
the introduction as follows. 

Definition 1.2. (See Figure[2]) If T, T" are trees, we say that T" is obtained from T 
by a positive rotation, or, equivalently, that (T, T') is a positive base pair, if there 
exists an inner node v of T such that this w-subtree of T has the form Ti h {T2 A T^) 
and T" is obtained from T by replacing the v-subtree with (Ti A T2) A T3. In this case, 
we say that (T',T) is a negative base pair. 




T 2 T 3 Ti T 2 

FIGURE 2. A base pair: T' is obtained from T by rotating some subtree 
of T that can be expressed asTi A (T 2 A T 3 ) to the corresponding (Ti A T 2 ) A T 3 
(positive rotation) — or vice versa (negative rotation). 

By construction, rotations preserve the size of a tree. Conversely, it is easy to 
see that, if T and T 1 are trees with the same size, there exist a finite sequence 
of rotations that transforms T into T" — see for instance Remark 12.161 below — so a 
natural notion of distance appears. 
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Definition 1.3. If T,T' are equal size trees, the rotation distance between T 
and T", denoted dist(T, T"), is the minimal number of rotations needed to trans- 
form T into T". For n ^ 1, we define d(n) to be the maximum of dist(T, T') for 
T, V of size n. 

By definition, we have dist(T, T') = 1 if and only if (T, T") is a base pair. As 
proved in [T7J, Lemma 2], the inequality distfT, T') ^ 2n — 6 holds for all size n 
trees T, T" for n > 10, and, therefore, we havqj d(n) ^ 2n — 6 for n > 10. 

1.3. Triangulations. As explained in [T7], there exists a one-to-one correspon- 
dence between the triangulations of an {n + 2)-gon and size n trees: having chosen 
a distinguished edge, one encodes a triangulation by the dual graph, a tree that 
becomes rooted once a distinguished edge has been fixed (see Figure [3]). Then 
performing one rotation corresponds to performing one flip in the associated trian- 



gulations, this meaning that some pattern 




is replaced with 




,i.e., 



the diagonals are exchanged in the quadrilateral made by two adjacent triangles. 
So the rotation distance between two trees of size n is also the flip distance between 
the corresponding triangulations of an (n + 2)-gon, and the number d(n) is the 
maximal flip distance between two triangulations of an (n + 2)-gon. 





O 



/ \ 



FIGURE 3. Coding a triangulation of an (n + 2)-gon by a size n tree: 
choose a distinguished edge E, and hang the graph dual to the triangulation 
under the vertex corresponding to E. 



1.4. Associahedra. For each n, we have a binary relation on size n trees, namely 
being at rotation distance 1. It is natural to introduce the graph of this relation. 

Definition 1.4. (See Figure 2J) For n ^ 1, the associahedron K n is the (unori- 
ented) graph whose vertices are size n trees and whose edges are base pairs. 

The number of vertices of K n is ^- ( 2 r ™) , the nth Catalan number, and every 
vertex in K n has degree n — 1. The fact that any two trees of the same size are 
connected by a sequence of rotations means that K n is a connected graph. For all 
size n trees T,T', the number dist(T, T 1 ) is the edge-distance between T and T' 
in K n , and the number d(n) is the diameter of K n . 

The name "associahedron" stems from the fact that, when we decompose trees 
as iterated A -products, performing a rotation at v means applying the associativity 
law x^(y A z) — (x A y)^z to the w-subtree. 

Taking the sign of rotations into account, i.e., distinguishing whether associa- 
tivity is applied from x (y z) to (x y) A z, or in the other direction, amounts to 

^contrary to [17] . where notation changes from Section 2.3, we stick to the convention that n 
(and not n — 2) denotes the size of the reference trees 
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FIGURE 4. The associahedron if4: the vertices are the fourteen trees of 
size 4, or, equivalently, the fourteen triangulations of a hexagon, and the 
edges connect trees lying at rotation distance 1 or, equivalently, triangula- 
tions lying at flip distance 1. The diameter d(4) of A'4 is 4. The thin trees 
(1100) and (0011) (framed) are typical examples of size 4 trees at dis- 
tance 4. 



orienting the edges of associahedra, as shown in Figure [5] below. This orientation 
defines a partial ordering on K n , which admits the right comb (1™) as a minimum 
and the left comb (0 n ) as a maximum. This partial ordering is known to be a 
lattice, the Tamari lattice |T9l [lOl [H [18] . Let us mention that alternative lattice 
orderings on K n are constructed in fTS] and [12] . 



2. Using positions 

Hereafter we address the problem of establishing lower bound for the rotation 
distance dist(T, T") between two trees of size n, i.e., to prove that any path from T 
to T' through K n has length at least I for some £. Any such result immediately 
implies a lower bound for the diameter of the corresponding associahedron. 

To this end, we have to analyze the rotations that lead from T to T" and, for that, 
we need a way to specify a rotation precisely. Exactly as in the case of permutations 
and their decompositions into transpositions (instances of commutativity), we can 
specify a rotation (i.e., an instance of associativity) by taking into account either the 
position where the rotation occurs, or the names of the elements that are rotated. 
In this section, we develop the first approach, based on positions. 

2.1. The address of a rotation. The position of a vertex v in a tree T can be 
unambiguously specified using a finite sequence of 0's and l's that describes the 
path from the root of T to v, using for forking to the left and 1 for forking to 
the right. Such a sequence will be called a (binary) address. The set of all binary 
addresses will be denoted by A. The address of the root is the empty sequence, 
denoted 0. So, for instance, the addresses of the three inner nodes of the tree 

of (|l.ip are 0, 1, 10, whereas the addresses of its four leaves are 0, 100, 101, and 11: 






100 101 
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We deduce a natural indexation of subtrees by addresses. For T a tree and a a 
sufficiently short binary address, the ath subtree ofT, denoted Tr a \, is the subtree 
of T whose root is the vertex that has address a. Formally, T( a \ is recursively 
defined by the rules 



(2-1) 



T, 



(0) 



T for every T, and 



for T : 



T(0a) — Tb (a) 
T(la) = T 1(a) 

Note that T( a ) exists and has positive size if and only if a is the address of an inner 
node of T, and it exists and has size if and only if a is the address of a leaf of T. 
With such an indexation, we naturally attach an address with each rotation. 

Definition 2.1. We say that a positive base pair (T, T') has address a + if a is 
the address in T (and in T") of the root of the subtree that is rotated between T 
and T' . The address of the symmetric pair (T' , T) is declared to be a~ . 

For instance, in the positive base pair 

T T 



(2.2) 



the rotation involves the subtrees of T and T" whose roots have address 10, hence 
the address of (T,T r ) is declared to be 10 + . See Figure [5] for more examples. 





10 H 



1+ 



7x- 

|i+ 



J11+ Jl+ |0+ 

— (— + 1 

i+ ^ 



or 



FIGURE 5. Orienting the edges of the associahedron K4 yields the Tamari 
lattice, in which the right comb (1111) is minimal, and the left comb (0000) 
is maximal. Then taking positions into account provides for each edge a 
label that is a binary address. 



The idea we shall develop in the sequel is to obtain lower bounds dist(T, T') ^ I 
by proving that at least t addresses of some prescribed type necessarily occur in any 
sequence of base pairs connecting T to T", i.e., in any path from T to T" through 
the corresponding associahedron. 
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2.2. Connection with Thompson's group F. To implement the above idea, it 
is convenient to view rotations as a partial action of Thompson's group F on trees. 

We recall from [3] that Thompson's group F consists of all increasing piecewise 
linear self-homeomorphisms of [0, 1] with dyadic slopes and discontinuities of the 
derivative at dyadic points. There is a simple correspondence between the elements 
of F and pairs of trees of equal size. 

Definition 2.2. For p q in R and T a size n tree, we recursively define a 
partition [p, q\j< of the real interval [p, q] into n + 1 adjacent intervals by 

(2.3) M. ={&>,«]}, and [p, «]r = fo. U g] Tl - 

Then, for T, T' trees with equal sizes, we define $(T, T') to be the element of F 
that homothctically maps the zth interval of [0, 1]t to the ith interval of [0, 1]t' 
for each i. 

Example 2.3. Assume T = (11) and T = (00). The partitions of [0, 1] respectively 
associated with T and T" are 

[0, 1] T = {[0, ±], [1, |, [J, 1]}, and [0, 1] T , = {[0, J], [1 ±], [± 1]}. 

Therefore, $(T, T') is the element of Thompson's group F whose graph is 



(2.4) 




i.e., the element denoted A in [3] (or in most recent references). 

Infinitely many different pairs of trees represent a given element of F: if T and 
T" are obtained from To and Tq respectively by replacing the fcth leaf with a caret, 
then the subdivisions [0, 1]t and [0, 1]t' are obtained from [0, l]r and [0, 1]^ by 
subdividing the fcth intervals, and we have $(T, T") = 3>(To, Tq). However, for each 
element g of F, there exists a unique reduced pair of trees (r(g), t' (g)) satisfying 
&(r(g), T'(g)) = g, where (T, X") is called reduced if it is obtained from no other 
pair (Xb j To) as above [31 Section 2]. 

We can then introduce a partial action of the group F on trees as follows. 

Definition 2.4. For T a tree and g an element of the group F, we define T . g to 
be the unique tree T' satisfying $(T, T') = g, if if exists. 

This is a partial action: T . g need not be defined for all T and g. By construc- 
tion, T • g exists if and only if the partition [0, 1]t refines the partition [0, l] T (g) ; 
i.e., equivalently, if and only if the tree r(g) is included in T. However, for each 
pair of trees of equal size (T, T"), there exists an element g satisfying T . g = T', 
namely $(T, T'), and the rules for an action on the right are obeyed: if T . g and 
(T . g) . h are defined, then T . {gh) is defined and it is equal to (T • g) • h — we 
assume that the product in F corresponds to reverse composition: gh means "<?, 
then h" . For our purpose, the main point is the following direct consequence of 
Definition |2"U 
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Lemma 2.5. The partial action of F on trees is free: ifT.g andT.g' are defined 
and equal, then g = g' holds. 

2.3. The generators A a of the group F. It is now easy to describe the rotations 
with address a in terms of the action of an element of F. 

Definition 2.6. For each address a in A, we put A a = (aOO)). The 

family of all elements A a is denoted A. 

Example 2.7. The element A is $((11), (00)), hence it is the element A (or Xq) 
considered in Example 12.31 More generally, if a is T, i.e., a consists of 1 repeated 
i times, A a is the element usually denoted Xi, which corresponds to applying asso- 
ciativity at the ith position on the right branch of the tree. Viewed as a function 
of [0, 1] to itself, A a is the identity on [0, 1 — 2~ l ] and its graph on [1 — 2~ ! , 1] is 
that of xq contracted by a factor 2\ 

By definition, A a corresponds to applying associativity at position a, and, there- 
fore, we have the following equivalence, whose verification is straightforward. 

Lemma 2.8. For all trees T,T' with the same size, the following are equivalent: 

(i) (T,T') is a base pair with address a , 

(ii) T' =T. A^ 1 holds. 

It is well-known that the group F is generated by the elements A and Ai, 
hence, a fortiori, by the whole family A. For g in F, we denote by £a(9) the length 
of g with respect to A, i.e., the length of the shortest expression of g as a product 
of letters A a and A" 1 . Then the connection between the rotation distance and the 
length function £a is very simple. 

Proposition 2.9. For all trees T, T' with the same size, we have 
(2.5) dist(T,T') = £ A ($(T,T')). 

Proof. Assume that (To, is path from T to T' in K\tU i-e., we have To = T, 

Ti — T', and (T r , T r +i) is a base pair for each r. Let afr be the address of (T r , T r+ i). 
By construction, we have 

T.(A e a \...A%J = T', 
hence $(T,T') = A e a \...A e a e f by LemmaES and, therefore, t A (<S>(T,T')) s$ L 

Conversely, assume that A e ^ t —A% is an expression of the element $(T, T') of F 
in terms of the generators A a . It need not be true that T . (A^.-.A^ ) is defined, 
but we can always find an extension T of T (a tree obtained from T by adding more 
carets) such that T . (A^ ...A e £ '■ ) is defined and equal to some extension T' of T'. 
Hence we have dist(T, T') ^ I. Now, anticipating on Section 01 there exists a set I 
such that T and T' are obtained from T and T' respectively by collapsing the labels 
of I. As will follow from LemmaiH this implies dist(T, T') < dist(f , f 7 ) ^ I. □ 

2.4. Presentation of F in terms of the generators A a . We are thus left with 
the question of determining the length of an element of the group F in terms of the 
generators A a . Formally, this problem is closed to similar length problems for which 
solutions are known. In [7] and :T, explicit combinatorial methods for computing 
the length of an element of F with respect to the generating family {xq,xi}, i.e., 
{A ,Ai} with the current notation, are given. Similarly, the unique normal form 
of [21 Theorem 2.5] is geodesic with respect to the generating family {xi \ i ^ 0} and, 
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therefore, there exists an explicit combinatorial method for computing the length 
with respect to that generating family, i.e., with respect to {A , A\, An, ...}. Thus, 
one might expect a similar method for computing the length with respect to the 
family A of all A a 's. Unfortunately, no such method is known at the moment, and 
we can only obtain coarse inequalities. 

The first step is to determine a presentation of the group F in terms of the 
(redundant) family A. As we know a presentation of F from the x^s, it is sufficient 
to add the definitions of the elements A a for a not a power of 1. Actually, a much 
more symmetric presentation exists. 

If a, (3 are binary addresses, we say that a is a prefix of (3, denoted a E 0, if 
f3 = aj holds for some address 7. 

Lemma 2.10 ([H Prop. 4] or [5, Prop. 2.13]). In terms of A, the group F is 
presented by the following relations, where a, (3 range over A, 



(2.6) A 2 a = Aai ■ A a ■ A a0 , 

(2.7) A a op ■ A a = A a ■ AaQop, 

(2.8) A a io/3 ■ A a — A a ■ A q0 i/3, 

(2.9) A a np ■ A a = A a ■ A a ifj, 

(2.10) Ap ■ A a — A a ■ Ap if neither a E (3 nor (3 E a holds. 



Relations (|2.6|) are MacLane-Stasheff pentagon relations, whereas (|2.7|) - (|2.10|) 
are quasi-commutation relations with an easy geometric meaning. 

For a given pair of trees (T, T"), it is not difficult to find an expression of $(T, T') 
by an ^4 Q -word, i.e., a word in the letters A^ 1 — see Remark l2.16l below. In general, 
nothing guarantees that this expression w is geodesic, and we can only deduce an 
upper bound dist(T, T") ^ |to|. However, we can also obtain lower bounds by using 
invariants of the relations of Lemma 12.101 

Lemma 2.11. For w an A a -word, let I(w) be the algebraic sum of the exponents 
of the letters A a with a containing no 0. Then I is invariant under the relations 
of Lemma \2.10\ 

Proof. A simple inspection. For instance, if a contains no 0, there are two letters 
of the form A^ on both sides of (|2.6p . whereas, if a contains at least one 0, there 
are no such letter on either side. Also notice that the invariance property holds for 
the implicit free group relations A a ■ A^ 1 = 1 and A~ x ■ A a — 1. □ 

Proposition 2.12. An A a -word that only contains letters Ax* with positive expo- 
nents is geodesic. 

Proof. Using \w\ for the length (number of letters) of a word w, we have I{w) 
\w\ for every Ac-word w, by definition of /. Now, if w contains only letters A^ 
with positive exponents, then we have I(w) = \w\. By Lemma I2.11| we deduce 
\w\ = I(w) = I{w') ^ \w'\ for each yl Q -word that is equivalent to w under the 
relations of Lemma 12.101 □ 

Remark 2.13. By contrast, it is not true that a positive v4 Q -word, i.e., an A a - 
word in which all letters have positive exponents (no A" 1 ) need to be geodesic, or 
even quasi-geodesic: for p 1, let us write A^ for A alP -iA alP -2...A a iA a . Then 
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one can check that, for each p, the positive A a -word 

A (p) 4 (p-1) a(p~ 2 ) 4(2) 4(1) 
^01 ^-0101 •■■ ^"-(oi)p- 2 (oi)p- 1 ' 

which has length p(p + l)/2, is equivalent to the v4 a -word 

^0 ^^0^0 1 ^01^ 10 ■■• ^(01)p- 2 ^( 1 i)p-2 ^(01)p- 1 j 

which has length 3p — 2. In other words, the submonoid of F generated by the 
elements A a is not quasi-isometrically embedded in the group F. 

As an application, we can determine the distance from a right comb to any tree. 

Proposition 2.14. For each tree T of size n, we have 

(2.11) dist«l n ),T) =n-h R (T), 

where h H denotes the length of the rightmost branch. 

Proof. Using shi for the word homomorphism that maps A a to A\ a for each a, we 
recursively define an j4 Q -word wt by 



(2.12) w T 



e (the empty word) for T = •, 

w Tq ■ A 1 b R {T Q )-i...AiA S3 ■ shi(w Tl ) for T = T A Ti. 



An easy induction shows that, for each size n tree T, the length of wt is n — h H (T), 
and that we have (1") • wt = T, i.e., wt provides a distinguished way to go from 
the right comb (1") to T. 

Now, we see on (|2.12l) that wt exclusively consists of letters A]* with a positive 
exponent. Hence, by Proposition 12. 121 % is geodesic, and we deduce 

dist«l"),T) = K| = n-h R (T). □ 
Corollary 2.15. For each n, we have d(n) ^ n — 1. 

Proof. Applying (|2.1ip when T is the left comb (0™) — or any size n term with right 
height 1— gives dist((l n ), T) = n - 1. □ 

As can be expected, other proofs of the previous modest result can be given, 
for instance by counting left-oriented edges in the trees: our purpose in stating 
Proposition 12.141 is just to illustrate the general principle of using Thompson's 
group F and its presentation from the A a 's. 

Remark 2.16. The proof of Proposition 12.141 implies that, for each pair (T,T') 
of size n trees, the A Q -word w^, 1 Wt' is an explicit expression of $(T, T 1 ) of length 
at most 2n — 2. It corresponds to a distinguished path from T to T' via the right 
comb (1™) in the associahedron K n . 

2.5. Addresses of leaves. We turn to another way of using addresses to prove 
lower bounds on the rotation distance, namely analyzing the way the addresses of 
the leaves are modified in rotations. 

Definition 2.17. For T a tree and 1 ^ i ^ \T\ + 1, we denote by addr(i) the 
address of the ith leaf of T in the left-to-right enumeration of leaves. 
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Equivalently, we can attribute labels 1 to n + 1 to the leaves of each size n tree T 
enumerated from left to right and, then, addT(«) is the address where the label i 
occurs in T. For instance, if T is the (thin) tree of then the labelling of the 

leaves of T is 




2 3 

and we find add T (l) = 0, add T (2) = 100, add T (3) = 101, and add T (4) = 11. 

The idea now is that, for each i, we can follow the parameter addy(i) when 
rotations are applied. 

Lemma 2.18. Assume that (T, T") is a base pair and 1 ^ i ^ |T| + 1 holds. Then 
addr'(i) is equal to add-r(i) or it is obtained from addr(i) by one of the following 
transformations, hereafter called special: adding or removing one 0, adding or 
removing one 1, replacing some subword 10 with 01 or vice versa. 

Proof. Assume that the address of (T,T') is a + and addT(i) = 7 holds. Then four 
cases may occur. If a is not a prefix of 7, then we have addT'(i) = 7- Otherwise, 
exactly one of a0 E 7, alO E 7, or all E 7 holds. If aO E 7 holds, say 7 = a0/3, 
then we have addy'(i) = a00/3, obtained from 7 by adding one 0. If alO E 7 holds, 
say 7 = a0/3, then we have addT'(z) = a01/3, obtained from 7 by replacing one 10 
with 01. Finally, if all E 7 holds, say 7 = all/3, then we have addy/(i) = al/3, 
obtained from 7 by removing one 1. The results are symmetric for a negative base 
pair. □ 

So Lemma [2 . 1 81 shows that the parameters addT(«) cannot change too fast, which 
can be easily translated into lower bounds on the rotation distance. For a, 7 two 
binary addresses, we denote by #^7 the number of occurrences of a in 7. 

Lemma 2.19. for 7, 7' in A, define 

-5(7,7') = |# 7' - # 7| + W - #iT| + |# 10 7' - # 10 7|- 
Then, for all trees T, T' of equal size, we have 

(2.13) dist(T,T')^ max 6(&dd T (i), add T ' (»))• 

Proof. Lemma 12.181 shows that, for each z, one rotation changes by at most one 
the value of <5(addT(«), addr'W)- Indeed, adding or removing one can change 
the value of |# Q addT'(«) — # Q addT(i)| by one, but it changes neither [^addT'M — 
# 1 addT(«)| nor |# 10 addT'(«) — # 1Q addT(i) | ■ Similarly, exchanging 10 and 01 once 
can change the value of |# 10 addr'(«) — # 10 a ddT(*)| by one, but it changes neither 
|# Q addT'(i) — # Q addT(i)| nor l^add^'^) — # 1 addT(*)|- Thus, if <5(addT(«), addr'(i)) 
is i, then at least I rotations are needed to transform T into T". □ 

Proposition 2.20. ForT = (Ti A .) A T 2 , with \T X \ = \T 2 \ =p-l, andT' = ((10) p ), 
we have 

(2.14) dist(T,T') ^ 3p- 2. 

Proof. Both T and T" have size 2p. By construction, we have 

add T (p + 1) = 01 and add T '(p + 1) = (10) p . 

We have # Q 01 = #,01 = 1, # 1Q 01 - 0, and # q (10)p = #,(10)? = # 10 (10)p = p, 
whence 6(01, (10) p ) = 3p-2, and (l2~14l) follows from (|2~T5)l . □ 
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Corollary 2.21. For each n, we have d(n) ^ |n 



3„ _ 5 

2 1 



Proof. Proposition 12.201 gives d(n) |n — 2 for n even. A similar argument gives 
d(n) ^ |n - | for n odd. □ 

2.6. Addresses of leaves (continued). The method can be refined to obtain a 
more precise evaluation of the number of rotations needed to transform an address 
into a special given address, typically one of the form l p 9 . For 7 a binary address, 
we denote by ^(7) the unique path in N 2 that starts on N x {0}, finishes on {0} x N 
and contains one edge (0,-1) for each 1 in 7 and one edge (1,0) for each in 7, 
following the order of letters in 7; see Figure |6l Then we have the following result. 

Lemma 2.22. (See Figured) For 7 satisfying ^7 ^ p and ^7 q, put 

(2.15) f{p, q, 7) = (P - #f7) + (q - # 7) + A( 7 ) + D( 7 ), 

where N(j) denotes the number of squares lying below ^(7) and adjacent to a co- 
ordinate axis, and D(j) is the distance in the N 2 -grid from (1,1) to the region 
above Then, for all trees T,T' satisfying addy(i) = l p q , ^addx'ii) ^ p, 

and # Q addT'(*) ^ q, we have 

(2.16) dist(T,T') ^/(p,g,add T '(i)) 



P 













q 

1 1 > 



FIGURE 6. Application of Lemma [2T221 to 7 = 101001001 with p = 4 
and q = 7: we find p - #fl = 4- 4 = 0, q - # () 7 = 7-5 = 2, 
the number of squares below 71(7) touching one of the axes is 7, and the 
distance from (1,1) to the region above 7r(7) is 1, leading to /(4, 7, 7) = 
+ 2 + 7 + 1 = 10. Hence, if T and T' are trees in which, for some i, the 
ith leaf has address 1 4 7 in T and 7 in T', we have dist(T,T') ^ 10. 



Proof (sketch). As in Lemma \2. 191 one checks that f(p, 9,7) decreases by at most 
one when a special transformation is applied to 7. By Lemma 12.181 this implies 
that /(p, q, addr' (?)) decreases by at most one when a rotation is applied to T". □ 

Proposition 2.23. For T = (P0 9 ) ; T = (0 9 P) with p,q^l, we have 
(2.17) dist(T, T') > p + q + min(p, q) - 2. 

Proof. As FigureHUwill show, we have add T (p + 1) = 1P 9 and add T /(p + 1) = q V, 
so applying Lemma r2. 221 gives the lower bound dist(T, T") p+q+min(p, q) — 2. □ 

(It is then easy to check that (|2.17p is an equality by finding an explicit path 
from T to T' involving the expected number of rotations.) So we reobtain for d(n) a 
lower bound — 2 for even n, and |n— | for odd n. More refinements are possible, 
but approaches that only take one address at a time into account are unlikely to 
go beyond In + 0(1). 
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3. Using names 

As the previous approach based on positions leads to limited results only, we 
now develop an alternative approach based on names. This is exactly similar to 
investigating a permutation not in terms of the positions of the elements that are 
permuted, but in terms of the names of the elements that have been permuted. Here 
we shall associate with every base pair a name (a, b, c, d^ consisting of four numbers 
and a sign. The principle for using names will be the same as with positions: 
we prove dist(T, T") ^ I by showing that any path from T to T' through the 
associahedron Km must visit at lest £ edges whose names satisfy some specific 
constraints. Here the constraints will involve the so-called covering relation, a 
binary relation that connects the leaves of the tree, taking the form "the ith leaf is 
covered by the jth leaf in T", denoted i <\t j- The basic observation is that, if the 
ith leaf is not covered by the jth leaf in T, but is covered in T", then any sequence 
of rotations from T to T' must contain a base pair whose name has a certain form, 
namely (a, 6, c, d) + with c — j and a ^ i < b. The rest of the paper consists in 
exploiting this principle in more and more sophisticated ways. 

3.1. The name of a base pair. As in Section [^751 we attach labels to the leaves 
of a tree. For each label i occurring in a tree T, we denote by addT(*) the address 
where i occurs in T. The only difference is that, in view of Section [4j we shall not 
necessarily assume that the labels used for a size n tree are 1 to n + 1 . However we 
always assume that the labels increase from left to right. So, for instance, 

5 6 

is considered to be a legal labeling and, if T is the above tree, we would write 
addx(2) = and addr(6) = 101. Formally, this amounts to hereafter considering 
labeled trees. However, every unlabeled tree is identified with the labeled tree where 
labels are 1, ... , \T\ + 1. 

Switching to labeled trees does not change anything to rotations and the derived 
notions. If T and T' are labeled trees, we say that (T, T') is a base pair if (T, T ) 
is a base pair, where T and T the unlabeled trees underlying T and T', and, in 
addition, the same labels occur in T and T' (necessarily in the same order by our 
convention that labels increase from left to right). As the associativity law does 
not change the order of variables, this definition preserves the connection between 
rotation and associativity. 

We now attach to each base pair (of labeled trees) a name that specifies the 
rotation in terms of the labels of leaves. 

Definition 3.1. (See Figures 171 and ITD1 below.) Assume that (T, T") is a positive 

base pair. Let a + be the address of (T, T"). Then, the name of (T, T"), denoted 

v(T, T'), is defined to be (a, b, c, d) + , where 

a is the unique label in T that satisfies addr (a) = a0 p for some p, 
b is the unique label in T that satisfies addr(6) = alO p for some p, 
c is the unique label in T that satisfies addr(c) = alOP for some p, 
d is the unique label in T that satisfies addr(rf) — al p for some p, 

In this case, the name of (T',T) is defined to be (a, b, c, d)~. For k = 1, ...,4, the 

fcth entry in u(T,T') is denoted v k (T,T'). 
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bed a b c 



FIGURE 7. Name of a base pair of trees: Vr{T,T') and v±{T,T'), i.e., a 
and d, are the names of the extremal leaves in the subtree involved in the 
rotation, whereas Uz(T,T') and vz(T,T'), i.e., b and c, are the names of 
the extremal leaves in the (nested) subtree that is actually moved in the 
rotation. 

Example 3.2. Let us consider the pair of (|2 . 2[) again, namely 



4 5 4 5 



With the default labels, the name of (T, T") is (2, 3, 5, 6) + : indeed, 2 and 6 are the 
extreme labels of the subtree Trim involved in the rotation, whereas 3 and 5 are 
the extreme labels in the subtree T( 10 io) that is actually moved. 

By definition, when T is given, the address of a base pair (T, T") determines its 
name. The converse is true as well: if (T, T") has name (a, 6, c, d) , then the address 
of (T, T') is a , where a is the longest common prefix of addr(a) and addrid), 
hereafter denoted addy (a) a addy (d) . This address is also add^' (a) a addx' (d) . 

3.2. The covering relation. The main tool that will enable us to use names to 
establish lower bounds for the rotation distance is a binary relation that provides 
a description of the shape of a tree in terms of the names of its leaves. 

Definition 3.3. (See Figure [8]) If i < j are labels in T, we say that i is covered 
by j in T, denoted i <t j, if there exists a subtree T' of T such that i is a non-final 
label in T" and j is the final label in T'. 

The covering relation <Jt provides a complete description of the tree T. 

Proposition 3.4. Every labeled tree T is determined by its covering relation <]y. 

Proof. Assume first that T has the default labeling, i.e., the labels are 1 to \T\ + 1. 
By construction, \T\ + 1 is the largest label in T, and every label is covered 
by \T\ + 1, so <t determines \T\. For an induction, it suffices now to show that, for 
T = Xb A Ti, the relation <\t determines the size |Tn| of To. Now 1 is covered in T 
by | To | + 1, the final label in To, but it is not covered by |To| + 2, nor is it cither 
covered by any integer ^|T|. So we have |T | = max-jj ^ |T| | 1 <t j}- 

The argument is similar for an arbitrary labeled tree, which is determined by its 
shape plus the family of its labels. □ 
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i < T 3 i >* T j 



FIGURE 8. The covering and co-covering relations: i is covered by j in T 
if there exists a subtree T' such that i is a non-final label in T' , whereas is 
the last (rightmost) label in T' . Symmetrically, i co-covers j in T if there 
exists a subtree T' such that i is a the initial (leftmost) label in T', and j 
is a non-initial label in T' . 

According to the notation of (|2.1|) , each subtree of a tree is specified by a binary 
address. Introducing the address of the subtree T' involved in Definition 13.31 gives 
the following rewording of the definition. We recall that, if a and (3 are binary 
addresses, a E f3 means that a is a prefix of (3, i.e., [3 consists of a possibly followed 
by additional O's and l's. 

Lemma 3.5. If i,j are labels in T , then i is covered by j in T if and only if there 
exists a binary address 7 satisfying 

(3.2) 7O E addT(«) and add^O) = 7l p for some p ^ 1. 

For future reference, we mention some simple properties of the covering relation. 

Lemma 3.6. (i) The set of labels covered by j in T is a (possibly empty) interval 
ending in j — X: if i <t j and i ^ i' < j hold, then so does i' <\t j ■ 
(ii) The relation <y is transitive. 

Proof. Point (i) is clear from the definition: the elements covered by j in T are the 
non-final labels in the maximal subtree T' of T that admits j as its final label: this 
means that T' is either T itself, or it is some subtree T(~\ such that 7 ends with 0. 

For (ii), we observe that, if T' is a subtree of T witnessing for j <t k, and T" 
is a subtree witnessing for i <\t j, then the subtrees T' and T" cannot be disjoint 
since j is the label of a leaf that belongs to both of them, and, then, T" must be 
included in T' , so each non-final label in T" is a non-final label in T" as well. □ 

3.3. The co-covering relation. The definition of covering gives a distinguished 
role to the right side. Of course, there is a symmetric version involving the left side. 

Definition 3.7. (See Figured!) If i < j are labels in T, we say that i co-covers j 
in T, denoted i \>* T j, if, for some subtree T" of T, the integer i is the initial label 
in T' , and j is a non-initial label in T'. 

To avoid confusion, we shall always state the covering and co-covering properties 
for increasing pairs of labels i < j, thus saying u i is covered by j" rather than 11 j 
covers i" , and u i co-covers j" rather than "j is co-covered by i" . The counterpart 
of (|3 . 2|) is that i co-covers j in T if and only if there exists 7 satisfying 

(3.3) add*r(«) = 70 p for some p ^ 1 and 7I c addrO'). 

The counterparts of Lemmas 13.51 and 13.61 are obviously true. 

More interesting are the relations that connect covering and co-covering. 
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Lemma 3.8. Assume i < k ^ j < £. Then the relations i <\t j and k t>* T I exclude 
each other. 

Proof. Assume i < k ^ j , i <Jt j and k t>* T I. We claim that I ^ j holds. Indeed, 
let T( a ) be a subtree of T witnessing for i <\t j, and be a subtree witnessing 
for k [>* T I. According to (|3.2[) and (|3.3p . we have 

aO E add z (i), add T (j) = al p , add T (fc) = [30 q , and /31 E add T (£) 

for some p, g ^ 1. If a lies on the left of (3, then al p lies on the left of [30 q as well, 
contradicting the hypothesis k ^ j. If a lies on the right of [3, then addT(i) lies on 
the right of f30 q as well, contradicting the hypothesis i < k. So a and j3 must be 
comparable, i.e., one of the subtrees Tt a s,T/p\ is included in the other. The only 
case that makes k ^ j possible is Trp\ being strictly included in T( Q ), in which case 
we have i < k ^ I ^ j . □ 

Lemma 3.9. Assume a <\t b. Then the following are equivalent: 

(ii) we have addT(a) = 701 p for some p 0, where 7 is addr(a) a addT(&); 
(m) a + 1 \>* T b holds; 

(Hi) a <t i fails for each i in [a + 1, b — 1] . 

Proof. (See Figure [HI) By Lemma |3"31 the hypothesis implies that addr(fe) is 7I 9 
for some positive q. Assume (i). Then addy(a + 1) must be of the form 7l0 r for 
some positive r and, therefore, a+1 co-covers b in T, i.e., (ii) holds. On the other 
hand, a can be covered by no label between a+1 and b — 1, so (Hi) holds. 

Conversely, we have 7 E add T (a + 1) a add T (6) under the hypotheses. Hence, if 
(ii) holds, then addi^a + 1) must be of the form 7l0 r , in which case add^(a) must 
be of the form 701 p for some positive p. Hence, by the counterpart of Lemma l3.5i 
(ii) holds. 

Finally, if (i) fails, the address of a in T must be of the form 7OPO7' for some 
nonnegative p and some 7'. But, then, there exists a label c > a whose address has 
the form 701 r , and we have a <\t c, so (Hi) fails. □ 

a a+1 b a c b 

FIGURE 9. Proof of Lemma l3^9l on the left, the case when (i), (ii), (Hi) 
hold; on the right, the case when they fail. 

3.4. The Key Lemma. We arrive at the main point, namely analyzing the influ- 
ence of rotations on the covering and co-covering relations. The result is simple: 
a positive rotation creates some covering and deletes some co-covering, a negative 
rotation does the contrary. The nice point is that there exists a close relation 
between the name of a rotation and the covering or co-covering pairs it creates 
or deletes. This will directly lead to lower bounds on the rotation distance: as 
one rotation only changes the covering and co-covering relations by a small, well 
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controlled amount, if two trees T, T" have very different covering and co-covering 
relations, many rotations are needed to transform T into T' . 

First, we note for further reference the following straighforward facts, whose 
verification should be obvious from Figure [71 We naturally use i <t j for "i <t j 
or i = j" , and similarly for i t>* T j. 

Lemma 3.10. If (T, T') is a base pair with name (a, b, c, d) , we have 

(i) a \>* T i and i <!t d for each label i of T lying in [a, d]; 

(ii) a >* T i and i <y 6—1 for each label i of T lying in [a, b — 1] ; 
(Hi) b t>* T i and i <t c for each label iofT lying in [b, c]; 

(iv) c + 1 >p i and i <t d for each label i of T lying in [c + 1, d]. 

The roles of T and T' in Lemma l3.10l are symmetric, so all covering results stated 
for T also hold for V . 

Here comes the main point, namely the way covering and anticovering change in 
a base pair. 

Lemma 3.11. If(T,T') is a base pair with name (a,b,c,d) + , then, for all labels i, j 
occurring in T , 

(i) i <\t> j holds if and only if we have either i <\j< j , or a ^ i < b and j = c; 

(ii) i t>* T j holds if and only if we have either i t>* T , j , or i = b and c < j ^ d. 

Proof, (i) For i < a, we have i j and i -fir' j for j < d, and i <t j < ^=> i <T' j 
for j ^ d. Similarly, we have i <\t j <=^ > i <t' j f° r d < i < j . So the point is 
to consider the pairs (i,j) with a ^ i < j ^ d. As is clear from Figure nothing 
changes from T to T" when i and j both are in [a, b — 1], or in [6, c], or in [c+ 1, d]. 
For the cases when i lies in one of the intervals and j in another, one sees on the 
figure that the only case when <t and <t' disagree is when i lies in [a,b — 1], 
thus corresponding to a leaf in the left subtree (the one with address aO), and j is 
corresponds to the rightmost leaf in the central tree, which has address alO in T 
and aOl in T'. In this case, we have i f&T j and i <\x> j- 

The case of (ii) is symmetric, exchanging the roles of left and right and the 
inequalities. □ 

So, if (T, T') is a positive base pair, the only difference between the relations <t 
and <t' are that vs(T,T') covers more elements in T" than in T. Symmetrically, 
the relations r>^ and \>* T , are equally close, the only difference being that 1/2 (T, T") 
co-covers less elements in T' than in T. This leads us to a criterion for recognizing 
that certain types of base pairs inevitably occur on any path connecting two trees. 

Notation 3.12. In the sequel, we say "pair (...,...,..., ...) ± " for "base pair with 
name (..., ...) ± ". Also, we use abbreviated notation such as ( , >i , j , ... ) + 
to refer to any pair ( a , b , c , d ) + satisfying a ^ i, b > i, and c = b. 

Lemma 3.13 (Key Lemma). Assume that the trees T,T' satisfy 

(3.4) i j and i <t' j- 

Then each sequence of rotations from T to T' contains a pair ( , >i , j , ... ) + . 

Proof. Let (To, ... , Tg) be a path from T to T' in K\t\ 1 i-s., assume Tq — T, Ti = T' , 
and (T r ,T r+ i) is a base pair for each r. By (|3.4[) . we have i ^t j and i <r e j, so 
there must exist an integer r satisfying 

(3.5) i ^ Tr j and i <r r+1 j- 
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By Lemma l3.11f i). this can occurs only if we have 

Vx(T r ,T r+ i) < i < v 2 {T r , T r+1 ) and ^ 3 (T r , T r+l ) = j. 
In other words, the pair (T r , T" r +i) has the form ( , >i , j , ... ) + . 



□ 



Thus the covering relations partition the associahedra into regions. What Lem- 
ma GTS] says is that one cannot go from one region to another one without crossing 
the border, which corresponds to base pairs with a certain type of name, see Fig- 
ure [TUJ 



l5?34& 2<4 \1<4 

(1,2,4,5/ i 



(2,3,3,4/ 



(2,4,4,-5/. ■ 



2^4&3<|4 
3^4"'- 



(1,2,3,4/, 



(1,2,3,5/ 



^(i,2,2,i/j : |i 



(1,2,2,3/ 



7k- 

1(2,3,3,5/ 

J(3,4,4,5/ |(3,4,4,5/ i i |(1,3,3.4/ 



^(2,3,4,5/ 



(1,2,2,4/" 



(2,3,3,4/ 



(1,2,4,5/ 



FIGURE 10. Names of edges and covering relation in the associahe- 
dron K4. The regions correspond to the various possibilities for covering 
by 4. As i <\ 4 implies j <J 4 for i < j < 4, there are four regions corre- 
sponding to 1 < 4, to 1 5*5 4 and 2 < 4, to 2 4 and 3 <3 4, and to 3 5*3 4. 
Lemma [3.13l savs for instance that the only way to leave the region 3^4 
is to cross a pair named ( ... , 4 , 4 , ... ) + . 



Of course, we have a symmetric statement involving the relation >*. 
Lemma 3.14. Assume that the trees T,T' satisfy 

(3.6) i \>* T j and i \ft* T , j, 

(3.7) or i \>* T j and i — 1 <\x> j — 1. 

Then each sequence of rotations from T to T' contains a pair (...,«, <j , ) + . 

Proof. The statement for Q3.6P is the exact conterpart of Lemma [3~4l when left and 
right are exchanged. As for (|3.7p . Lemma [3.81 says that i— 1 <t' j — 1 implies 
i p^, j, and, therefore, the hypotheses (|3.7|) imply the hypotheses (|3.6p . □ 

3.5. Refinements. More precise criteria will be needed in the sequel, and we shall 
now establish some refinements of Lemmas l3.13l and l3.14l All are based on these ba- 
sic results, but, in addition, they exploit the geometric properties of the relations < 
and >* . The crucial advantage of the criteria below is that they provide stronger 
constraints for the parameters of the involved pairs: for instance, Lemma I3US1 spec- 
ifies two of the four parameters completely ("z + 1 and j"), whereas Lemma [3.131 
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only gives one exact value ("j"), and not more than an inequality for another one 
(">i"). The price to pay for the improvement is a strcngthtcning of the hypotheses 
and, chiefly, a disjunction in the conclusion. 

Lemma 3.15. Assume that the trees T,T' satisfy 

(3.8) i $t j, i <t> j, and i + 1 >* T , j. 

Then each sequence of rotations from T to T' contains a pair ( ... , i + 1 , j , ... ) + , 
or a pair ( . . . , i + 1 , . . . , j ) ~~ . 

Here we shall prove Lemma |3. 151 directly — this can be done as a good exercise — 
but rather derive it from a more elaborate statement. The new refinement consists 
in getting constraints on three parameters at a time. 

Lemma 3.16. Assume i < k < j and the trees T,T' satisfy 

(3.9) i i <T' h an d £ + 1 !>t' j- 

Then each sequence of rotations from T to T' contains a pair ( ^z , i<...^k , j , ... ) + , 
or a pair ( , i<...^fc , ~^k , j )~ ■ 

Proof of Lemma \3.15\ from Lemma \3.1b\ Assume first j > i + 1. Put k = i + 1. 
Then i < b ^ k implies b = i + 1. So Lemma T3.16I guarantees that there is at least 
one pair named 

(<i, i + 1, j, ...)+ or (<i, ^si + 1, 

which is the expected conclusion. 

Assume now j = i + 1. By hypothesis, we have i <^t j and i <It> j, so 
Lemma [3.13l gives a pair ( ^i , >i , j , ... ) + , hence necessarily ( ... , i + 1 , i + 1 , ... ) + 
again of the expected type. □ 

Proof of Lemma \3.1b\ We begin as in the proof of Lemma T3. 131 Let (To, ... , Te) be 
a path from T to T' in Km. By (|3.9|) . we have i ^t j and i <ir e j, so there exists 
a largest integer r satisfying 

(3.10) i fi Tr j and i <t,. +1 j, 

and, as above, this requires vi{T r ,T r+ i) ^ i < V2{T r ,T r+ {) and f 3 (T r , T r+ i) = j, 
so (T r , T r+ i) is a pair ( ^i , >i , j , ... ) + . Write b — V2{T r , T r+ i). 

If b k holds, (T r , T r+ i) is a pair ( , k^...<i , j , ... ) + , and we are done. 

So, from now on, we assume k < b. The hypotheses i < k implies i < 6—1, 
hence i <It,, b — 1 by Lemma l3.10f z). As we have b — 1 < b ^ Vs(T r , T r+ \) — j, we 
deduce 

(3.11) Bxe[k,j-l](i< Tr x) 

On the other hand, by Lemma 13 . 01 the hypothesis i + 1 t>* T , j implies 

(3.12) VzG [fc,j-i](M T , z). 
Therefore, there must exist s ^ r satisfying 

(3.13) 3x G [k,j — l](i <t s x) and Va; G [k,j — l](i ^t s+ i x). 
Choose such a s. Then, for some e in [k,j — 1], we have 

(3.14) i <\T a e and i ^t s+1 e. 
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By Lemma the name of (T s , T s +i) has the form ( 5Ci , >i , e , ... ) _ , hence, 

a fortiori, ( , >i , ^k , ... ) — . 

Moreover, we have s ^ r by construction, so the choice of r implies that i is 
covered by j in T s+ i, and, therefore, by Lemma 13.61 so is e + 1 since we have 
i < k ^ e < j. On the other hand, Lemma 13. lOfa ) gives e + 1 >^ i f4,(T s ,T s+ i), 
and Lemma 13.91 then implies that e + 1 is covered in T s+ \ by no element smaller 
than ^ 4 (T s ,T s+ i). We deduce U4,(T s ,T s +i) < j, and, even, z/ 4 (T s ,T s+ i) e [fc + 
as we have ^4(T s ,T s +x) > V3(T S) T s +i) ^ fc. Now, by Lemma f3. lOlf •i) . i is covered 
by va{T s , T s +i) in T s +i, whereas, by (|3.13p , it is covered by no element of [k, j — 1]. 
It follows that the only possibility is v^(T s ,T s+ i) = j. 

Finally, let / = V2(T s ,T s+ i). We already know that / > i holds, and we claim 
that / ^ k holds as well. Indeed, two cases are possible. For / = i + 1, the 
hypothesis i < k directly implies / k. For / > i + Lemma !3.10f M) implies 
that i is covered by / — 1 in T s+ i, and, therefore, (|3.13p implies that / — 1 cannot 
belong to [k,j — 1]. As / ^ e ^ j — 1 is true by construction, the only possibility 
is / — 1 < k, i.e., f ^ k. 

So (T s ,T s +i) is a pair ( , i<...^k , ^fc , j )~, as expected. □ 

For future reference, we finally mention the right counterpart of Lemma 13.151 It 
can of course be proved by a direct argument, or deduced from the right counterpart 
of Lemma T3. 161 (that we shall not need here). 

Lemma 3.17. Assume that the trees T,T' satisfy 

(3.15) i \>* T j, i <t j — 1, and i $-* T , j. 

Then each sequence of rotations from T to T' contains a pair ( ... , i , j — 1 , ... ) + , 
or a pair ( i , ... , j — 1 , . . . ) ~~ . 

3.6. Application: reproving a lower bound in 3n/2 + 0(1). As a first appli- 
cation of the previous results and a warm-up for the sequel, we shall now reprove 
Proposition 12 . 231 about the distance of "bicombs". 

Proposition 3.18. For T = (P0 9 ) ; T = (0 9 P) with p,q^l, we have 

(3.16) dist(T, T')^p + q + min(p, q) - 2. 




pH P+2 p pfl 

FIGURE 11. The bicombs of Propositions ^. 23l and l3. 181 — here with p = 4 
and g = 6. 

The method of the proof consists in identifying various families of base pairs, and 
to prove, using the results of Sections 13.41 and 13.51 that every sequence of rotations 
from T to T' contains at least a certain number of pairs of these specific types. 
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Proof. Up to a symmetry, we may assume p ^ q. Put n = p + q. Let us say that a 
base pair is special 

- of type I if it is ( ... , a , p+1 , ... ) + or ( ... , a , ... , p+l )~ with 2 ^ a ^ p + 1, 

- o/ type I a if it is ( ... , p+l , a , ... ) + or (p+l , ... , a , ... )~ with p+2 ^ a ^ n, 

- o/ type M a if it is ( 1 , =/=p+l , a , ... ) + with p+2 ^ a ^ n, 

- of type IV Q if it is ( ... , a , ^p , ... ) + with 2 ^ a ^ p. 

First we observe that the various types of special pairs are disjoint, i.e., a special 
pair has one type exactly: the four families are disjoint and, inside each family, the 
parameter a is uniquely determined. 

Let (T , ...,Tt) be a path from T to T in K n . First choose a in [2,p+ 1]. We 
see on Figure [Tl] that p+l covers a — 1 in T", but not in T, and, moreover, that a 
co-covers p + 1 in T. Applying Lcmma l3. 151 with i = a— 1 and j = p + 1 guarantees 
that (To, Ti) contains a pair ( ... , a, p+l, ... ) + or ( ... , a , ... , p + 1 ) _ , i.e., a 
special pair of type I a - Letting a vary from 2 to p + 1 guarantees that (To, Tg) 
contains at least p special pairs of type I. 

Similarly, choose b in [p + 2,n]. We see now that p+l co-covers b in T, but 
not in T', and, that, moreover, p + 1 is not covered by b + 1 in T'. Applying 
Lemma f3 . 1 71 with i = p+l and j = b+1 guarantees that (T , Tg) contains a pair 
( ... , p+ 1 , b , ... ) + or {p + 1 , ... , b , ... )~ , i.e., a special pair of type If,. Letting b 
vary from p + 2 to n guarantees that (To, Tg) contains at least q — 1 special pairs 
of type I. 

Consider b in [p + 2, n] again. Then 1 is covered by p + 1 in T, and not covered 
by p + 1 in T'. Applying Lemma 13.131 with i = 1 and j = b guarantees that 
(To, Ti) contains at least one pair ( 1 , a , b , ... ) + for some a satisfying 2 ^ a ^ b. 
Here two cases are possible. 

Case 1. For each b in [p + 2, n], there is a pair ( 1 , a , b , ... ) + with a ^ p+l, 
or type W a . In this case, letting b vary from p + 2 to n guarantees that (T , T) 
contains at least q — 1 special pairs of type IH. 

Case 2. There exists b such that (To, Tg) contains no special pair of type Mi,. 
Owing to the above observation, this implies that there exists r such that (T r , T r+ i) 
is ( 1 , p + 1 , b , ... ) + . By Lemma R.lOf M). this implies that 1 is covered by p in T r . 
In this case, we claim that there must exist in (To, ...,T r ) a pair of type W a for 
each a in [2,p\. Indeed, consider such an a. The hypothesis that 1 is covered by p 
in T r implies that a — 1 too is covered by p in T r . On the other hand, we see that 
a co-covers p + 1 in T. Applying Lemma 13.141 with i = a and j = p+l guarantees 
that (T), ...,T r ) contains a pair ( ... , a , ^p , >p ) + , hence a special pair of type iV Q . 
So, in this case, (To, Te) contains at least p—1 special pairs of type IV. 

Summarizing, we conclude that (To, ...,T) contains at least 

p special pairs of type I, 
q — 1 special pairs of type H, 
and q — 1 special pairs of type 11 or p — 1 special pairs of type IV, 

hence at least 3p — 2 special pairs. As these pairs are pairwise distinct, the distance 
between To and Tg, i.e., between T and T', is at least 3p — 2. □ 

The previous argument is illustrated in the case of size 4 trees in Figure [T2] 
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FIGURE 12. Proof of Proposition 13.181 Introducing special pairs of vari- 
ous types amounts to colouring the edges of the associahedron K4. In the 
current case, we use four colours, namely I2, I3, H4, and a common colour 
for M4 and IV2, plus a neutral colour for the edges that receive none of the 
previous colours, i.e., for non-special pairs. The proof shows that each path 
from (1100) to (0011) must contain at least one edge of each of the four 
colours — as can be checked on the picture — hence the distance between the 
trees (1100) and (0011) (framed squares) is four. 



4. Collapsing 

The previous method is powerful, but the results obtained so far remain lim- 
ited. In order to establish stronger results, we now add one more ingredient called 
collapsing, which is a certain way of projecting an associahedron K n onto smaller 
associahedra K n > with n' < n. The idea is very simple: collapsing a set of labels / 
in a tree T means erasing all leaves whose labels belong to /, and contracting the 
remaining edges to obtain a well- formed tree collj(T). Collapsing a set of labels I 
in the two entries of a base pair (T, T") yields either a base pair, or twice the same 
tree, in which case we naturally say that the pair (T, T") is /-collapsing. This im- 
plies that the rotation distance between T and T 1 is at least the distance between 
the collapsed trees coll/(T) and coll/(T'), plus the minimal number of inevitable 
/-collapsing pairs between T and T'. This principle will enable us to inductively de- 
termine the distances dist(T p , Tp) for trees T p , T' p such that T p _i is obtained from T p 
and T p _ 1 is obtained from T p by collapsing some set of labels (the same for both). 

4.1. Collapsing. Assume that T is a finite binary tree. For / C N, we consider 
the tree obtained from T by removing all leaves whose labels lie in / (if any). In 
order to include the case when all leaves are removed, we introduce an empty tree 
denoted 0, together with the rules 

(4.1) A T = T, T A = T. 

It is then coherent to declare |0| = — 1. Objects that are either a finite labeled tree 
or the empty tree will be called extended trees. We denote by Lab(T) the family of 
labels occurring in T . 
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Definition 4.1. For I C N and T an extended tree, the I-collapse of T, de- 
noted coll/(T), is recursively defined by 

for T = 0, 

for \T\ = and Lab(T) C I, 

T for \T\ = and Lab(T) % I, 

coll 7 (T 1 ) A coll / (T 2 ) for T = T! A T 2 . 

Example 4.2. Assume T = ((1 A 2) A (3 A 4)). Then we find for instance 

coll {1} (T) = 2 A (3 A 4), coll {2i3} (T) = l A 4, coll {1>2 , 3 ,4}(T) = 0. 

Properties of collapsing are mostly obvious. In particular, it should be clear 
that we always have |coll/(T)| = |T| — #(Lab(T) n I). Also, we have the following 
compatibility with the covering and co-covering relations. 

Lemma 4.3. If i is covered by ( resp. co-covers) j in T , and i,j do not belong to I , 
then i is covered by (resp. co-covers) j in coll/(T). 

Remark 4.4. We took the option not to change the remaining labels when some 
labels are collapsed. So, even we start with a tree T in which the labels are the 
default ones, namely 1 to |T| -I- 1, after collapsing we are likely to obtain a tree T 
in which the labels are not 1 to \T\ + 1. That is why it seems preferable to consider 
general labeled trees, i.e., to allow jumps in the sequence of labels. 

4.2. Collapsing a base pair. For our current purpose, the question is to connect 
the rotation distance between two trees and the rotation distance between their 
images under collapsing. The point is that the collapsing of a base pair is either a 
diagonal pair, i.e., a pair consisting of twice the same tree, or it is still a base pair, 
whose name is easily connected with that of the initial pair. 

Lemma 4.5. For I C N and (T, T") a base pair of name ( a , b , c , dy, 

(i) either we have 

(4.3) [a, 6-1] CI or [b,c]CI or [c+l,d]CI, 

and then coll/(T) and collr(T') coincide, 

(ii) or (coll/ (T), coll/ (I")) * s a base pair of name (a,b,c,d) + with 
a = min([a, b — 1] \ I), b = min([6, c] \ I), c = maxQfo, c] \ I), d = max([c + 1, d] \ I). 

Proof. See Figure [7] If one of the three subtrees T/ a0 \, T^ aW ^, T( Qll ) completely 
vanishes, which happens when at least one of the three inclusions of (|4.3p is true, 
then collj(T) and coll/(T') coincide. Otherwise, i.e., if at least one leaf of each of 
the above three subtrees remains, we have 

coU 7 (T (a) ) - coll / (T (Q0) ) A (coll / (r (al0) ) A coll / (r (Qll) )), 

collj(T ( ' a) ) = (coll / (T (Q0) ) A coll / (r (al0) )) A coll / (T (c[11) ), 

Hence (collr^,-,)), collj(T/»)) is a positive base pair, and so is (collj(T), collj(T')). 
The name should then be clear from the picture □ 

Definition 4.6. A base pair is called I-collapsing if at least one of the three 
conditions of (|4. 3|) is satisfied. If T,T' are trees, the I -distance between T and T", 
denoted I-dist(T, T'), is defined to be the minimal number of /-collapsing steps 
occurring in a sequence of rotations from T to T'. 



(4.2) coll/(T) = 
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A direct application of Lemma l4~5l is the following useful relation: 
Lemma 4.7. For all trees T,T' and all sets I, we have 

(4.4) dist(T, T') > dist(coU/(T), collj(T')) + J-dist (T, T 1 ). 

Proof. Let (To, ... , TJp) be a path from T to T" in Km ■ By Lemma l4~5l the sequence 
(collj(Tb), collj(X^)) is a path from collj(T) to collj(T') in Kjr, and the number 
of nontrival pairs in this path is the number of non- /-collapsing pairs in (To, ... ,Tg). 
Therefore, we have £ > dist (coll/ (T), coll/ (T')) + Tdist(T, T"). □ 

Remark 4.8. By Lemma l4~5l the inequality (|4.4|) is an equality for dist(T, T') ^ 1. 
This need not be true in general. For instance, let T = (1100) and T' = (0011). 
We saw in Figure Q~2] that the distance between T and T" is 4. Now, we have 
coll {4:5} (T) = coll {4:5} (T') = (11), hence dist(coll {4i5} (T),coll { 4 :5} (T')) = 0. On 
the other hand, it can be checked on Figure [12] that there exists a path from T 
to T' in Ki that contains only two {4, 5}-collapsing pairs, namely 

t — XX ► X s ► — v 

{4,5}- coll. A {4,5}-coll. A A {4,5}-coll. A {4,5}-coll. 

C0ll {4 5} C0ll {4 5} C0ll {4 5} C0ll {4 5} C0ll {4 5} 

■4* >4* 4- "4* >4* 

> A^ = X s = A^ > S\ 

so we have {4, 5}-dist(T, T') < 2 (actually = 2). 

4.3. Double collapsing. Technically, it will be convenient to use two collapsings 
at a time, with respect to sets that are strongly disjoint in the following sense. 

Lemma 4.9. Assume that I, J satisfy the condition 

(4.5) Vie/Vje J ( [i,j] £/u J). 

Then, for all trees T, T' , we have 

(4.6) dist(T, T') > dist (coll/ (T), coU/(T') + 7-dist(coll,/(T), coUj(T')). 
Proof. We claim that the inequality 

(4.7) /-dist(T, T') > /-dist (coll j (T), collj(T')) 

holds for all trees T,T'. Indeed, let (T ,...,T e ) be a path from T to T". Then 
(coll j (To), ... , collj(T^)) is a (possibly redundant) path from collj(T) to collj(T^). 
Put I — J-dist (coll j(T), collj(T')). By definition of /-dist, there must be at least 
I pairs (T r ,T r+ i) satisfying 

(4.8) collj(T r )) ^ collj(T r+ i) and coll/ (coll,/ (T r )) = coll/ (coll,/ (T r+1 )). 

As coll/ (coll j(— )) = coll/u,/(— ) always holds, (|4.8[) means that (T r ,T r+ i) is not 
J-collapsing, and is (/ U J)-collapsing. Now Condition (|4.9[) implies that every 
interval that is included in / U J is included in /, or is included in J. Owing to the 
criterion of Lemma 1431 we deduce that (T,.,T r+1 ) is /-collapsing, and, therefore, 
we have /-dist(T, T') > I. 

Then, using (|4.7|) . we obtain 

dist(T, T') > dist(coll/(T), coll/(T')) + J-dist (T, T') 

^ dist (coll/ (T), coll/(T")) + J-dist (coll j(T), coll jiT')), 

which is the expected inequality (|4.6[) . □ 
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4.4. Application: a lower bound in 5ra/3 + 0(l). Lemma |4~T1 provides a natural 
method for establishing a lower bound on the distance dist (T, T) in an inductive 
way: if dist(T,T') ^ I is known, and we can find a set / satisfying collf(T) = T 
and coll/ (T") = T , then it suffices to show that the minimal number of /-collapsing 
steps from T to T' is at least k to deduce dist(T, T') ~^£ + k. We shall now apply 
this principle to deduce from the results of Section [3T6l which provide a family with 
distance 3n/2 + 0(1), a new family achieving distance 5n/3 + 0(1). 

Proposition 4.10. For T = (P0 P P), T = (0 P (10) P ) with p ^ 1, we have 
(4.9) dist (T, TO >5p-4. 

As T and T' above have size 3p, we deduce 
Corollary 4.11. For n = 3 (mod 3), we /«aue d(n) ^ |n — 4. 




FIGURE 13. The trees of Proposition 14.101 — here with p = 4 — and the 
proof of the latter from Lemma [4.121 collapsing / (light grey labels) leads 
to 2-combs, whose distance is known; collapsing J (dark grey labels) leads 
to trees that are, up to shifting the labels, those of Lemma [4.121 

To prove Proposition 14.101 we shall use Proposition 12.231 (or 13 . 18[) and a con- 
venient collapsing. Fix some p, and let T and T be the coresponding trees of 
Proposition [221 namely (P0 P ) and (0 P F>). Collapsing T into T is easy: T is a 
zigzag of three alternating length p combs ("tricomb"), whereas T is a zigzag of 
two alternating combs ("bicomb"), so that we can project T to T by collapsing all 
labels from p + 2 to 2p + 1 . It then turns out that collapsing the same labels in T' 
leads to T . By Proposition 12. 23[ the distance of T and T is 3p — 2, so, owing to 
Lemma I4.7[ in order to establish Proposition 14. 101 it is enough to prove 

(4.10) [p + 2, 2p + l]-dist(T, TO ^ 2p - 2, 

i.e., to prove that each sequence of rotations from T to T' contains at least p — 2 
pairs that are [p + 2,2p + l]-collapsing. 

Instead of working with the trees of Proposition 14.101 themselves, it will be more 
convenient to use a second, auxiliary collapsing, and to use Lemma 14.91 We shall 
prove: 
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Lemma 4.12. For T = (P01 p ), t' = <(01) p 0) with p^l, we have 
(4.11) [p + 2,2p+l]-dist(T,T"') > 2p-2. 

Proof of Proposition ^. lD\ from Lemma \4.12\ Let I = [p + 2, 2p + 1] and 
J = [2f> + 3, 'dp + 1]. Then the sets 7 and J satisfy Condition (|4.6p . and we have 
T = coll j (T) and T' = coll. / (T / ). Lemma SU then gives 

dist(T,T') ^ dist(coll/(T),coll/(T')) + /-dist(T, T~'). 

As can be checked on Figure Q2J we have collr(T) = (1 P P ) and (0 P 1 P ). Using 
Proposition 12 . 231 and (|5.3p , we deduce 

dist(T, T') > (3p - 2) + (2p - 2) = 5j? - 4, 

as expected. □ 

Proof of Lemma \4-12\ The argument is similar to the one used for Proposition l3.18l 
with the additional difficulty that we need pairs that are [p + 2, 2p + l]-collapsing. 
Put n = 2p + 1, I = [p + 2, 2p], and, for p + 2 ^ a < 2p, say that a base pair is 

-special of type l a if it is ( ... , a , a , ... ) + 

-special of type 1+ if it is ( ... , ^p + 1 , a , ^n) + 

-special of type I~ if it is ( ^.p , ... , + 1 , a)~ 

-special of type M a if it is ( ... , a + 1 , ... , n + 1 ) — . 

It is straightforward that a special pair has a unique type, and that all special 
pairs are /-collapsing: with obvious notation, we have [^2,^3] Q I for type I Q , and 
[1/3 + 1, 1/4] C I for types 1+ , E~ , and IH Q , which is enough to conclude using the 
criterion of Lemma 14.51 

Let (To, ■■■ 7 Ti) be a path from T to T in K^. Choose a in [p + 2, 2p\. First, as 
can be read on Figure [T3l we have 

a — 1 ^tjt a and a — 1 a. 

Lemma 13.131 guarantees that (To, ... , Ti) contains a pair ( — 1 , >a — 1 , a , ... ) + , 
i.e., a pair of type I a . 

Next, let a' = n + 1 — a. Then we have 2 ^ a' ^ p, and we read the relations 

a' a, a' a, and a' + 1 a. 

Applying Lemma 13.161 with i — a! , j — a, and k — p + 1 guarantees the existence 
of r such that (T r ,T r+ i) is 

(<a', a'<...<p+l, a, ...)+ or ( ^a' , a'<...<p+l, + 1 , a)~. 

In the latter case, we have a special pair of type H~. In the former case, we have 
a special pair of type 1I„ provided the last parameter, namely i>4(T r ,T r +i), is at 
most n. Now assume this is not the case, i.e., we have VA{T ri T r +i) = n + 1. By 
Lemma I3.10( w) , we have a + 1 >y n + 1 , hence a + 1 > j. n + 1 asa+1 ^ n holds 
by hypothesis. But, by hypothesis, we have a + 1 \fi* T n + 1. By Lemma[3T4j there 
must exist s ^ r such that (T s , T s+ i) is ( ... , a + 1 , ... , rl + 1 ) — , hence special of 
type IH a . 

Hence, for each of the p—1 values p + 2, ... , 2p, the path (To, ... , T£) contains a 
pair of type I a , and a pair of type , IT , or IH a . Hence (To, ... , Tg) contains 
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p — 1 pairs of type I, 

p — 1 pairs of type or II, 

hence at least 2p — 2 pairs that are /-collapsing. □ 

Remark 4.13. The previous result is optimal. It is not difficult to construct an 
explicit path of length hp — 4 from T to T', and to check that its projection is a 
path of length 2p — 2 from T to T , all of which steps are /-collapsing. Also, one 
can observe that, in the above proof, the final argument showing the existence of a 
special pair of type n a also shows the existence of a pair (...,a+l,...,n-|-l) + . 
It follows that each path visiting a vertex satisfying a >* n + 1 for some a ^ p + 3 
contains at least 2p — 1 pairs that are /-collapsing. Hence such a path has length 
at least bp — 3 and it is not geodesic: moving n + 1 to the right is never optimal. 

4.5. Alternative families. To conclude this section, we mention still another fam- 
ily witnessing a growth rate of the form 3n/2 + 0(1). The analysis of this family, 
which heavily uses collapsing, is easier than that of Proposition I3.18[ but, con- 
trary to the arguments developed above, it does not seem to extend for proving 
stronger results. The trees we consider are zigzags, with a slight change near the 
ends — though seemingly minor, that change modifies distances completely. 

Proposition 4.14. For to ^ 0, we have 

(4.12) dist((l(10) m 0), (0(01) m l)) = 3m + 1. 




m+2 m+3 m+l m+2 

FIGURE 14. The trees of Proposition 14.141 — here with m = 5: collapsing 
I (light grey labels) from T m and T' m leads (up to a relabeling) to T m -2 
and T' m _2, while collapsing J (dark grey labels) leads (up to a relabeling) 
to T 2 and T^. 

Proof. Let T m = (l(10) m 0) and T' m = (0(01) m l). We use induction on m. For 
to 1, the result is easily checked by a direct computation. Assume m 2. Put 

Im = {to, to + 1 , to + 3, to + 4} and J m = [1, m — 2] U [m + 6, 2to + 3] . 

As can be read on Figure HU we have 

coll /m (T m ) =T m _ 2 and coU/ m {T'J = T' m _ 2 , 
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as well as 

coll, /m (T m ) = T 2 and coll Jm (T' m ) = T' 2 . 

The sets I m and J m satisfy the disjointness condition (|4.5[) . Applying Lemma l4~7l 
we deduce 

dist(T m , T,'„) > dist(T m _ 2 , T/ n _ 2 ) + 7 2 -dist(T 2 , T 2 '). 
A brute force verification — or a proof using the techniques of Section 13.41 — gives 

Z 2 -dist(r 2) T^) = 6, 

and dist(T m , T^) > 3m + 1 follows inductively. The other inequality, whence (|4. 12[) . 
are easily checked by a direct computation. □ 

The remarkably simple proof of Proposition l4.14l rclies on the conjunction of two 
properties. First, the two families of trees we consider are stable under two types 
of collapsing simultaneously: whether we collapse from the top or from the bottom, 
we can manage to remain in the same family. Second, the involved collapsing are 
perfect, in the sense that the inequality (|4.4[) turns out to be an equality, a necessary 
condition if we are to obtain an exact value. 

It is not difficult to obtain other families satisfying one of the above two proper- 
ties. In particular, for each p ^ 1, the thin trees (P(10) m (P) and ((P(01) m l p ) are 
eligible. Proposition 14. 141 then corresponds to p = 1. The choice p = is uninter- 
esting, but p ^ 2 gives seemingly large distances. Applying the scheme above leads 
to looking for the unique parameter 

([p + 1, 3 P + 3] \ {2p + 2})-dist((P(10) p+1 (F), (0 p (01) p+1 P)), 

which we have seen is 6 for p = 1. For p = 2, one obtains the value 10, but it is 
then easy to see that such a value cannot give an equality in (|4.4|) . In this way, one 
obtains lower bounds for the distances of the involved trees, but these bounds are 
not sharp, and it seems hard to obtain very strong results. For instance, for p = 2, 
collapsing six leaves guarantees ten collapsing steps, and one cannot obtain more 
than 5n/3 + (3(l). 

5. A LOWER BOUND in 2n + 0(^/n) 

It is not hard to repeat the argument of Section 14.41 so as to construct explicit 
trees of size n at distance 7n/4 + O(l). Unfortunately, a furher iteration seems 
difficult and, in order to go farther, we shall have to develop a more intricate 
argument — yet the principle always remains the same. The main difference is that, 
now, we will collapse a size 2p bicomb rather than a size p comb. 

5.1. Starting from an m-comb. The main result of this section is analogous to 
Proposition 14 . 1 Ol but the source tree is a 2m-comb rather than a tricomb, i.e., it is 
a tree obtained by stacking 2m left and right combs, alternately. The target tree 
is again a zigzag preceded by a short left comb — see Figure [T5l 

Proposition 5.1. For m,p ^ 1, we have 

(5.1) dist(((P0 p ) m ), (0 p (10) (m - 1)p l p » > Amp - 3m -p + 1. 

By letting m stay fixed and p vary, we obtain size n trees whose distance grows at 
least — actually, one can check that (15 . 1 j) is an equality — as (4m— l)n/(2m) + 0(1). 
By letting m and p vary simultaneously, we obtain a lower bound in 2n + 0{y/n): 
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rap mpfl rap mpYl 



FIGURE 15. The trees of Proposition 15.11 — here with m = 3 and p = 4 — 
and the proof of the latter from Lemma [B~3l collapsing / (light grey labels) 
leads from (T m>p ,T^ tP ) to {T m -i, p , Tm-i,p)l collapsing J (dark grey labels) 
leads from (T m> p,T^ p ) to the pair (T p ,T p ), which does not depend on m 
and is, up to shifting the labels, the pair of Lemma [5~3l 



Corollary 5.2. For n of the form 2m 2 , we have 
(5.2) d(n) > 2n + 1 - 2 V2^. 
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Moreover, d(n) ^ 2n — Cy/n holds for each n with C = \/70. 

Proof (of Corollary [5jH from Proposition \5.1\) . Assume first n = 2m 2 . Then, choos- 
ing p = m and using (|5.ip , we find 

d(n) = d{2mp) dist(((l p O p ) m ), (0 p (10) (m ~ 1)p l p )) = 4m 2 - 4m + 1, 

whence (|5.2p . For the general case, choose m = \ L ^/hnJlA\ , p — |_\/7n/10j . Then 
we have 2mp ^ n, whence 

d(n) > d{2mp) > dist(((l p p ) m ), (0 p (10) (m - 1)p l p )) = 4mp - 3m - p + 1. 

by (|5.ip . Using m ^ ^/5n/14 < m + 1 and p < ^/7n/10 < p+ 1, one obtains 

4mp - 3m - p + 1 > 2n - (7y/5n/U - 5^7^/10) + 5 = 2n - V70n + 5. □ 

The proof of Proposition 15.11 uses an induction on the parameter m, i.e., on 
the number of alternations in the zigzag-tree ((1 P P )™ 1 ). It is not hard to find 
how to collapse T m , p to T m _i p and T' m v to T^_ lp . Then the key point consists 
in identifying sufficiently many collapsing pairs, and this is done in the following 
result. 

Lemma 5.3. Let T = (1 P 01 P P ), T' = (0(10) p l p ), and I = [p + 2,2p+l]. Then, 
for each p 1, we have 

(5.3) 7-dist(T,T') >4p-3. 

Proof of Proposition I5.il /rom Lemma \5.3\ The argument is exactly similar to the 
one used to deduce Proposition ^. 101 from Lcmma r4.12t We use induction on m ^ 1. 
The trees T hp and T{ p are the 2-combs (1 P P ) and (0 P 1 P ), and Proposition [233] 
(or !3.18p gives dist(Ti )P , T{ p ) ^ 3p ~ 2, in agreement with (|5.ip . 
Assume now m ^ 2, and let 

I = [(m- l)p + 2, (m+l)p+l] and J = [1, (m — 2)p] U [(m + l)p + 3, 2mp + 1]. 

Then / and J satisfy the disjointness condition of Lemma 14. 9[ and we read on 
Figure [T5] that /-collapsing maps T mtP to T m — ij» and to T' m _ l p . On the 

other hand, J-collapsing maps T m , p to a tree T p that is, up to shifting the labels, 
the tree T of Lemma [5~3l and T' m to a tree T p that is, up to shifting the labels, the 
tree T 1 of Lemma 15.31 Applying Lemma 14.91 the induction hypothesis for m — 1 , 
and Lemma 15.31 we obtain 

dist(T' mj p, T mp ) 

> dist (coll/ (T m , P ) , coll/ (T^ p )) + 7-dist (coll/ (T m , p ) , coll/ (T^)) 
= dist(T m _i )P , T' n _ lp ) + 7-dist(T p , T p ) 

> (4(m-l)p- 3(m-l) - p + 1) + (4p - 3) = 4mp - 3m-p+ 1, 
which is (|5~Tj) . □ 

5.2. Special pairs. The proof of Lemma 15.31 will occupy the next sections. It is 
parallel to the proof of Lemma l4.12[ but it is more involved and requires some care, 
mainly because we are to introduce many different types of special pairs. 

In the sequel, we put n = 3p+ 1 and q — 2p+ 1. Then T and T" are size n trees, 
displayed in Figure [TB] — compared with the trees T p and T p of Figure the only 
difference is that the labeling has been standardized. 



ON THE ROTATION DISTANCE BETWEEN BINARY TREES 



31 




9 9+1 2p q 



FIGURE 16. The trees of Lemma [5731 here with p — 4, and, in grey, the 
labels of J, which are those for which we need to count the /-collapsing 
pairs. 

Then we consider the following eleven (!) families of base pairs. We say that a 
base pair is 



-special 


of type 


1+ 


if it 


is ( 


... , a, q, ... )+ 


with 


P- 


-2 




a 




q 


-special 


of type 


1,7 


if it 


is ( 


... , a, . 


.,?)" 


with 


P- 


-2 




a 


< 


q 


-special 


of type 


1+ 


if it 


is ( 


... , q, a 


,...)+ 


with 


?H 


-1 




a 


< 


n 


-special 


of type 


1,7 


if it 


is ( 


q, a 


,...)" 


with 


<H 


-1 




a 


< 


n 


-special 


of type 


K 


if it 


is ( 


a, <q, ...)+ 


with 


?H 


-2 




a 


< 


<1 


-special 


of type 


m- 


if it 


is ( 


... , a, . 


., n + l)~ 


with 


JH 


-2 




a 




q 


-special 


of type 


K 


if it 


is ( 


>v- 


f 2&^ g , a, ...)+ 


with 




-1 


< 


a 




n 


-special 


of type 


vl 


if it 


is ( 


.... <p- 


f 1, a, 


with 




-1 




a 


< 


n 


-special 


of type 


V7 


if it 


is ( 


»., <P 


f 1 , >p, a)^ 


with 


<H 


-1 




a 


< 


n 


-special 


of type 


VT+- 


if it 


is ( 


... , a + 


1, n + l)+ 


with 


5H 


-1 




a 


< 


n 


-special 


of type 


vi- 


if it 


is ( 


... , a + 


1, ...,n + l)" 


with 


SH 


-1 




a 


< 


n 



Claim 1. Every special pair is I-collapsing. 

Proof. As we consider size n trees, every base pair satisfies v± ^ n+l, hence 
1/3 ^ n and, thereforen vj, £ /. For all special pairs except those of type V, we have 
V2 P + 2, hence [V2, ^3] Q I- For the special pairs of type V, we have 1/3 ^ p + 1 
and 1/4 ^ n, hence [1^3 + 1 , 1/4] C. I. In all cases, the criterion of Lemma 14.51 implies 
that the pair is /-collapsing. □ 

Claim 2. A special pair has a unique type, except, for q + l^b<a<n, 

- the pairs (...,6+1, a , n + l ) + , which have type both IVT7 and Vljj", 

- the pairs ( q , 6+1, a , n + 1 ) — , which have type both H~ and Vlr . 

Proof. A positive pair cannot coincide with a negative one, so we can consider 
positive and negative pairs separately. Next, for each type r a , the value of a can 
be recovered from one of the parameters Uk of the pair, hence it is impossible that 
a pair of type T a be of type t& for some 6 ^ a. So it remains to check that, for each 
pair (r a , T' b ) of distinct types (with the same sign), it is impossible that a special pair 
be simultaneously of type r Q and T* h — up to the exceptions mentioned in the claim. 
This is done in the two arrays below. As should be clear, "vi>qj vi^xp means that 
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a pair cannot be of the considered two types r, r' simultaneously because, for being 
of type r, we must have v 2 > q whereas, for being of type r', we must have v 2 ^ <?• 



n 


K 


K 


K 










V3=q/v3<q 


V3=q/v 3 >q 


V3=q/"3>q 


V3=q/v3>q 


*i 




V2=q/v2<q 


v 2 =qjv 2 ^q 


is 2 =q/v 2 ^P + 1 


V2=qjv2>q 


K 






vz<q/v 3 >q 


V3<q/v 3 >q 


v 2 <q/v 2 >q 










v 2 ^p + 2/i/ 2 <p + 1 


possible 















n 



n,: 



vr 



vir 



V2^q/v 2 >q 



v^n/v±=n + 1 



z/ 2 >P + 2/i/ 2 <p + 1 



Vi=q/vA=n + 1 



v\=q/vi<q 



possible 



^2^P + 2/v 2 ^p + 1 



V2^q/v2>q 



So all cases have been considered. 



□ 



We shall now exploit the differences of covering and co-covering between T 
and T". First, we use the fact that p + 1 to q — 1 are not covered by g in T, 
whereas they are in T", and, symmetrically, the fact that q co-covers q + 2 to n 
in T, whereas it does not in T' . 

Claim 3. Every path from T to T' 

- contains a pair of type for each a in [p + 2, q], 
and - contains a pair of type for each a in [q + 1, n — 1]. 

Proof. Assume a G [p + 2, q]. Then we have p+l^a— l^S 2p, and, as can be 
read on Figure 1161 we have 

a — 1 q, a — 1 <t' 9, and a >* T , q. 

Applying Lemma |3 . 1 51 with i = a — 1 and j = q shows that every path from T to T' 
contains a pair ( ... , a , q , ... ) + , of type I+, or a pair ( ... , a , ... , q )~, of type I~. 

Assume now a G [<? + 1, n — 1]. Then we have g + 2^a+l^n, and, as can be 
read on Figure 1161 we have 

q t>* T a + 1, q <t a, and q \ji-* T , a + 1. 

Applying Lemma 13.171 with i = q and j = a + 1 shows that every path from T 
to T' contains a pair ( ... , q , a , ... ) + , of type 1^" , or a pair (q , ... , a , ... )~, of 

type Iq . □ 

Remark 5.4. The second argument above also applies to a = q, but then it 
possibly leads to a pair (..., q , ... ) + that would be of type Lj and may have 
been already considered in the first argument. 

We shall now exploit the fact that p + 1 is not covered by any label from q + 1 
to n — 1 in T, and is covered by these labels in T. 

Claim 4. Every path from T to T' 

- contains a pair of type for each a in [p + 2, q — 1], 
or - contains a pair of type TVb for each b in [q + 1, n — 1]. 
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Proof. For p = 1, the result is vacuously true, and we assume p ^ 2. Let (To, ... , Tg) 
be any path from T to T'. Assume that, for some b in [q + l,n — 1], there exists 
no pair of type IVj, in (T , ... , T). As we read on Figure [TC] the relations 

p + 1 ^It & and p + 1 <t' &j 

applying Lemma 13.131 with i = p + 1 and j = b shows that there must exist r 
such that (T r , T r +i) is ( /i , g , , ... ) + for some g, h satisfying h p + 1 and g 
p + 2. The hypothesis that this pair is not of type IV& implies g — q. As p ^ 2 
holds, we have /i ^ p + 1 < q — 1 and, therefore, Lemma I3.10f z) implies /i <T r 
<7 — 1. Let a be any element of [p + 2, q — 1]. By Lemma 13.61 ft- <r r g — 1 implies 
a — 1 <jy r g — 1. On the other hand, we see on Figure [TBI that a co-covers q in T. 
Applying Lemma 13.171 with i = a and j — q shows that (To, ... , T r ) contains a pair 
(..., a, <q, of type H+ □ 

The next result uses the fact that each label a between 2 and p is not covered 
by n + 1 — a in T, whereas it is in T". The possible interference of the label n + 1 
makes the result slightly more complicated — as was already the case in the proof 
of Lemma 14.121 This step is the most delicate one, as it requires the full power of 
Lemma 13.161 and not only Lemma l3.13l or Lemma 13.151 

Claim 5. For each a in [q + 1, n — 1], every path from T to T' 

- contains a pair of type , 
or - contains a pair of type VI^ and a pair of type VL~ . 

Proof. Let (To, ... , Tg) be a path from T to T", and let a be an element of [q + 1, n — 1], 
Put a! = n + 1 — a. Then we have 2 sC a' ^ p, and we read on Figure[l6]the relations 

a' ■fix a i a ' <lr' a, and a' + 1 a. 

Applying Lemma 13. 161 with i = a', j = a, and k = p + 1 shows that there exists r 
such that (T r , T r+ i) is ( ^a' , a'<...^p + 1 , a , ... ) + or ( ^a' , a'<...^p + 1 , >p , a )~. 
The latter pair is of type . As for the former one, two cases are possible: if 
Vi(T r ,T r+ i) ^ n holds, then (T r ,T r+ i) is of type Vjj~, else we necessarily have 
v±{T r , T r+ i) = n. Then (T ri T r+ i) is a pair (...,..., a , n + 1 ) + , in which case, by 
Lemma I3.10f zf ), we have a + 1 >^ n + 1, and even a + 1 n + 1 as a < n 
is assumed. Now, we read on Figure [Jj)] that a + 1 co-covers n+1 neither in T 
nor in T'. Applying Lemma 13.141 with i = a + 1 and j = n + 1 guarantees that 
(To, ... , T r ) contains at least a pair ( ... , a + 1 , ... , n + 1 )+ of type VI~, and that 
(T r , ... ,Tg) contains at least a pair ( ... , a + 1 , ... , n + 1 ) + , of type VLJ . □ 

The last claim of the series will be used to cope with the possible interference 
between types I~ and VP. 

Claim 6. Assume that T* is a size n tree and q co-covers n+1 in T„. Then every 
path from T to T* contains 

- a pair of type for each a in [p + 2, q] . 

Proof. Let T = co\l [q+hn] {T) and T* = coll [9+ i, n] (T*). Then we have T = (OP), 
and, by Lemma 14.31 the hypothesis that q co-covers n+1 in T, implies that q 
co-covers n+1 in T*. Let a be any element of [p + 2, q]. By Lemma 13.81 a — 1 
cannot be covered by n in T,. On the other hand, a — 1 is covered by n in T, 
and, moreover, a co-covers q in T. Applying Lemma 13.151 to (T*,T) with i = a 
and j = q shows that every path from T to T* contains a pair (..., a, n+1) - 



34 



PATRICK DEHORNOY 



or ( ... , a , ... , q ) + . It follows that every path from T to T* contains a pair that 
projects to a pair of the previous form when [q + 1, n] is collapsed. 

Now, Lemma 14.51 shows that ( a' , b' , d , d')~ projects to ( ... , a , q , n + 1 )~ 
if and only if we have 6' = a, sup([6',c'] \ [g+l,n]) = q, hence d ^ (?, and 
sup([c' + l, d']\[q + l,n]) = n + 1, hence cf = n + 1, implying that ( a' , 6' , c' , d! )~ 
is special of type IH~ . 

Similarly, (a' , b' , d , d! ) + projects to ( ... , a , ... , q ) + for 6' = a, c' = sup([6', c']\ 
[g + 1, ra]) < q, hence c' < q, and sup([c' + 1, d'] \ [q + 1, ra]) = hence d' < n, so 
that ( a' , 6' , c' , d! ) + is special of type HL^~. □ 

5.3. Proof of Lemma l5.3l We are now ready to prove Lemma l5~3l The argument 
is similar to the one used for proving Lemma l4.12l but we have to be more careful 
because of the possible interferences between special pairs of type VI and special 
pairs of other types. It may be noted that such problems never occur when add(n) 
remains l p 01 p+1 throughout the considered path from T to T'\ proving the result 
for the trees coll{ n+1 }(r) and collj n+1 } (T") would be easier. 

Proof of Lemma [inK Let (To, ... ,Ti) be any path from T to T'. We have to show 
that this path contains at least Ap — 3 special pairs. The latter can correspond to 
several combinations of types, and we consider three cases. 

Case 1: (To, ... , T) contains a pair of type for each a in \p + 2, q — 1]. 
Then (To, ... ,T) contains at least 

p pairs of type I ± by Claim 

p — 1 pairs of type I ± by Claim [3l 

p—1 pairs of type IH + by hypothesis, 

p—1 pairs of type or VI + by Claim \S\ 

hence at least 4p — 3 special pairs, which are /-collapsing by Claim [TJ and are 
pairwise distinct by Claim [51 since we appeal to no pair of type IW (which could 
intcrfer with VI + ) or VP (which could interfer with I - ). 

Case 2: There exists a in \p + 2, q — 1] such that (To, ... , T) contains no pair of 
type IH^, and there is no r such that q co-covers n + 1 in T r . 

Then Claim [2] implies that (To, ... , Tg) contains no special pair that is simulta- 
neously of types I~ and VI - . Indeed, if a pair is both of types I~ and VI^", it has 
the form ( q , ... , ... , n + 1 )~, and, therefore, by Lemma l3.10f z). q co-covers n + 1 
in the two trees of that pair. Then (To, ... , Tg) contains 

p pairs of type I ± by Claim [31 

p—1 pairs of type I ± by Claim [3J 

p — 1 pairs of type by Claim [Hand the hypothesis, 



p- 



1 pairs of type or VI by Claim [5) 



and we have again Ap — 3 special pairs, which are /-collapsing by Claim [TJ and 
are pairwise distinct, since we appeal to no pair of type VI + (which could inter- 
fer with IV^), and interferences between type 1I~ and VI - are discarded by our 
hypotheses. 

Case 3: There exists a in \p + 2, q — 1] such that (To, ... , Tg) contains no pair of 
type IH^, and there exists r such that q co-covers n + 1 in T r . 
Then (T , ... ,Tg) contains 
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p pairs of type P 1 by Claim [31 

p — 1 pairs of type I ± by Claim [31 

p — 1 pairs of type IH ± by Claim 

p — 1 pairs of type IV 4 " by Claim [5] and the hypothesis. 



We still have found 4p — 3 special pairs, all /-collapsing by Claim [U and pairwise 
distinct by Claim [21 since we appeal now to no pair of type VI ^ . □ 

So the proofs of Lemma 15.31 and, therefore, of Proposition 15. 1[ are complete, 
yielding the exepected lower bound d{n) ^ 2n — 0{yfn) on the diameter of the nth 
associahedron. 

5.4. Going further. Proving the conjectured value d(n) = 2n — 6 for n > 10 
using the above methods seems feasible, but is likely to require a more intricate 
argument. Experiments easily suggest families of trees that should achieve the 
maximal distance, namely symmetric zigzag-trees with small combs attached at 
each end, on the shape of the trees T' m 3 of Proposition [CT] 

Conjecture 5.5. Define 

ji _ / (Hl(01) p - 3 00) T , = f (000(10)f- 3 ll> for n = 2p - 1, 
" |(lll(01) p - 3 000) " j(000(10) p - 3 lll) forn = 2p, 

Then one has dist(T„, X^) = 2n — 6 for n 11. 



The problem for establishing Conjecture 15.51 is that counting /-collapsing pairs 
cannot suffice here: various solutions exist for projecting (T„, T^) onto (T n _2, T' n _ 2 ) 
by collapsing two labels, but the minimal number of collapsing pairs is then 3, and 
not 4, as would be needed to conclude. On the other hand, the highly symmetric 
shape of the trees T n and T' n allows for other arguments that will not be developped 
here. 
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